On Combining Decisions from Multiple Expert Imitators for Performance
نویسندگان
چکیده
One approach for artificially intelligent agents wishing to maximise some performance metric in a given domain is to learn from a collection of training data that consists of actions or decisions made by some expert, in an attempt to imitate that expert’s style. We refer to this type of agent as an expert imitator. In this paper we investigate whether performance can be improved by combining decisions from multiple expert imitators. In particular, we investigate two existing approaches for combining decisions. The first approach combines decisions by employing ensemble voting between multiple expert imitators. The second approach dynamically selects the best imitator to use at runtime given the performance of the imitators in the current environment. We investigate these approaches in the domain of computer poker. In particular, we create expert imitators for limit and no limit Texas Hold’em and determine whether their performance can be improved by combining their decisions using the two approaches listed above.
منابع مشابه
A Novel Confidence-Based Framework for Multiple Expert Decision Fusion
A novel confidence-based parallel multiple expert decision combination framework is introduced. The traditional approaches to parallel multiple source decision fusion either take no account of the confidence of each decision by each participating expert in the combined framework or only exploit the confidences associated with each decision. But it is entirely possible to incorporate more additi...
متن کاملAutomated Detection of Multiple Sclerosis Lesions Using Texture-based Features and a Hybrid Classifier
Background: Multiple Sclerosis (MS) is the most frequent non-traumatic neurological disease capable of causing disability in young adults. Detection of MS lesions with magnetic resonance imaging (MRI) is the most common technique. However, manual interpretation of vast amounts of data is often tedious and error-prone. Furthermore, changes in lesions are often subtle and extremely unrepresentati...
متن کاملCombining data envelopment analysis and multi-objective model for the efficient facility location–allocation decision
This paper proposes an innovative procedure of finding efficient facility location–allocation (FLA) schemes, integrating data envelopment analysis (DEA) and a multi-objective programming (MOP) model methodology. FLA decisions provide a basic foundation for designing efficient supply chain network in many practical applications. The procedure proposed in this paper would be applied to the FLA pr...
متن کاملGoal Programming Optimization Model for Performance Management: A SCOR-Based Supply Chain Decision Alignment
This article develops an integrated model of transmitting strategies and operational activities to enhance the efficiency of supply chain management. As the second objective, this paper aims to improve supply chain performance management (SCPM) by employing proper decision-making approaches. The proposed model optimizes the performance indicator based on SCOR metrics. A process-based method is ...
متن کاملDependency Relationship Based Decision Combination in Multiple Classifier Systems
Although many decision combination methods have been proposed, most of them did not focus on dependency relationship among classiiers in combining multiple decisions. That makes classiication performance of combining multiple decisions be degraded and biased, in case of adding highly dependent inferior classiiers. To overcome such weaknesses and obtain robust classiication performance, the pres...
متن کامل